Ad-hoc Limited Scale-Free Models for Unstructured Peer-to-Peer Networks 
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Abstract 

Several protocol efficiency metrics (e.g., scalability, 
search success rate, routing reachability and stability) de- 
pend on the capability of preserving structure even over 
the churn caused by the ad-hoc nodes joining or leaving 
the network. Preserving the structure becomes more pro- 
hibitive due to the distributed and potentially uncoopera- 
tive nature of such networks, as in the peer-to-peer (P2P) 
networks. Thus, most practical solutions involve unstruc- 
tured approaches while attempting to maintain the struc- 
ture at various levels of protocol stack. The primary fo- 
cus of this paper is to investigate construction and mainte- 
nance of scale-free topologies in a distributed manner with- 
out requiring global topology information at the time when 
nodes join or leave. We consider the uncooperative behav- 
ior of peers by limiting the number of neighbors to a pre- 
defined hard cutoff value (i.e., no peer is a major hub), and 
the ad-hoc behavior of peers by rewiring the neighbors of 
nodes leaving the network. We also investigate the effect of 
these hard cutoffs and rewiring of ad-hoc nodes on the P2P 
search efficiency. 



1. Introduction 

Stability and scalability of highly dynamic networks 
mainly depends on the capability of preserving structure 
even over the churn caused by the ad-hoc nodes joining 
or leaving the network. Several protocol efficiency metrics 
(e.g., search success rate, routing reachability rate) depend 
on this capability. Preserving the structure becomes more 
prohibitive due to the distributed and potentially uncooper- 
ative nature of such networks, as in the peer-to-peer (P2P) 
networks. Thus, most practical solutions involve unstruc- 
tured approaches while attempting to maintain the structure 
at various levels of protocol stack. 

In decentralized P2P networks, the overlay topology (or 
connectivity graph) among peers is a crucial component in 
addition to the peer/data organization and search. Topolog- 
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ical characteristics have profound impact on the efficiency 
of search on P2P networks as well as other networks. It has 
been well-known that search on small-world topologies can 
be as efficient as 0(ln N) [30], and this phenomenon has re- 
cently been studied on P2P networks (43] [35 28 29\. The 
best search efficiency in realistic networks can be achieved 
when the topology is scale-free (power-law), which offers 
search efficiencies like O(lnlniV). Key limitation of scale- 
free topologies is the high load (i.e., high degree) on very 
few number of hub nodes. In a typical unstructured P2P net- 
work, peers are not willing to maintain high degrees/loads 
as they may not want to store large number of entries for 
construction of the overlay topology. So, to achieve fairness 
and practicality among all peers, hard cutoffs on the num- 
ber of entries are imposed by the individual peers, which 
makes the overall network a "limited" one. Effect of such 
hard cutoffs on search efficiency can be significant l27l . 

Due to the uncooperative nature of peers in a P2P net- 
work, protocols cannot completely rely on methods work- 
ing with full cooperation of peers. For example, peers may 
not want to store large number of entries for construction of 
the overlay topology, i.e., connectivity graph. Even though 
characteristics of the overlay topology is crucial in deter- 
mining the efficiency of the network, peers typically do 
not want to take the burden of storing excessive amount 
of control information for others in the network, thereby 
imposing hard cutoffs on the amount of control informa- 
tion to be stored. Yet another key issue is the construction 
of scale-free overlay topologies without global information. 
There are several techniques to generate a scale-free topol- 
ogy by using global information about the current 
network when a node joins or leaves. Such global methods 
are not practical in P2P networks, and local heuristics in 
generating such scale-free overlay topologies must be em- 
ployed. In other words, there must be local and simple op- 
erations when peers are joining or leaving the P2P overlay, 
and also causing a minimal inefficiency to the search mech- 
anisms to be run on the network. 

The primary focus of this paper is to investigate con- 
struction and maintenance of scale-free topologies in a dis- 



tributed manner without requiring global topology informa- 
tion at the time when nodes join or leave. We consider the 
uncooperative behavior of peers by limiting the number of 
neighbors to a pre-defined hard cutoff value, and the ad-hoc 
behavior of peers by rewiring the neighbors of nodes leav- 
ing the network. We also investigate the effect of these hard 
cutoffs and the rewiring of ad-hoc nodes on the P2P search 
efficiency. 

The rest of the paper is organized as follows: First, we 
provide motivation for this work, outline the key parame- 
ters to be considered, and briefly state the major contribu- 
tions and findings of the work. Then, we survey the previ- 
ous work on P2P networks in Section [2] In Section [3] we 
survey the previous work on scale-free topology generation 
and briefly cover the importance of cutoff in the scale-free 
network and Preferential Attachment (PA) with Hard Cut- 
offs. We introduce our topology generation techniques us- 
ing local heuristics and briefly describe the algorithm show- 
ing join and leave process of a node in the growing network, 
in Section |4] In Section [5] we present our simulation re- 
sults of degree distribution of the nodes. We also discuss 
the efficiency of three search algorithms i.e Flooding (FL), 
Normalized Flooding (NF) and Random Walk (RW) on the 
topology generated by our simulations. We conclude by 
summarizing our current work and outlining the directions 
for the future work in Section [6] 

1.1. Contributions and Major Results 

Our work uncovers the relationship between the ad-hoc 
behavior of peers (i.e., how frequent they join/leave) and the 
efficiency of search over an overlay topology where each 
peer can (or is willing to) store a maximum number of links 
to other peers. In our model, we parameterize (i) the ad-hoc 
behavior of nodes by the probability that a node leaves /i, 
(ii) the amount of local information to be used at the time of 
join by knowledge radius from the point the node attempts 
to join, Tj (i.e., the node knows about the local topology 
covering Tj hops away from the point the node attempts to 
join the network), and (iii) the amount of local information 
to be used at the time of leave by knowledge radius from 
the location of the leaving node, 77 (i.e., each neighbor of 
the leaving node knows about the local topology covering 
77 hops away from itself). We also define the maximum 
number of links to be stored by peers as the hard cutoff, 
k c , for the degree of a peer in the network as compared to 
natural cutoff which occurs due to finite-size effects. Our 
contributions include: 

• Guidelines for generating scale-free topologies over 
ad-hoc nodes: We introduce a generic model that 
can assign availability of different amount of local 
topology information at the times when a node joins 



or leaves. Our model provides a way of tuning ad- 
hocness of the network and studying how to balance 
state information for nodes joining or leaving. 

• Search efficiency on ad-hoc limited scale-free topolo- 
gies: Through extensive simulations, we studied effi- 
ciency of Flooding (FL), Normalized Flooding (NF), 
and Random Walk (RW) on the topologies generated 
by our model with different fi, Tj, 77, and k c values. 

• Rewiring methodologies for designing peer leave al- 
gorithms for unstructured P2P networks: Our study 
yielded several guidelines for peers leaving an unstruc- 
tured P2P network, so that the search performance of 
the overall overlay topology remains high. 

Our study revealed several interesting issues. We found 
that having more global information about the topology at 
the time of leave is significantly more helpful than having 
it at the time of join. We show that the degree distribution 
can be kept scale-free and the search efficiency can be kept 
very high by simply keeping 77 at reasonably high values, 
e.g., 2-3. 

2. Related Work 

Previous work on P2P network protocols can be classi- 
fied into centralized and decentralized ones. As centralized 
P2P protocols (e.g. Napster [3|) proved to be unscalable, 
the majority of the P2P research has focused on decentral- 
ized schemes. The decentralized P2P schemes can be fur- 
ther classified into sub-categories: structured, unstructured, 
and hybrid. 

In the structured P2P networks, data/file content of peers 
is organized based on a keying mechanism that can work in 
a distributed manner, e.g. Distributed Hash Tables (DHTs) 
[38]. The keying mechanism typically maps the peers (or 
their content) to a logical search space, which is then lever- 
aged for performing efficient searches. In contrast to the 
structured schemes, unstructured P2P networks do not in- 
clude a strict organization of peers or their content. Since 
there is no particular keying or organization of the content, 
the search techniques are typically based on flooding. Thus, 
the searches may take very long time for rare items, though 
popular items can be found very fast due to possible lever- 
aging of locality of reference [ 37 1 and caching/replication 

C3. 

The main focus of the research on unstructured P2P net- 
works has been the tradeoff between state complexity of 
peers (i.e., number of records needed to be stored at each 
peer) and flooding-based search efficiency. The minimal 
state each peer has to maintain is the list of neighbor peers, 
which construct the overlay topology. Optionally, peers can 
maintain forwarding tables (also referred as routing tables 



in the literature) for data items in addition to the list of 
neighbor peers. Thus, we can classify unstructured P2P net- 
works into two based on the type(s) of state peers maintain: 
(i) per-data unstructured P2P networks (i.e., peers maintain 
both the list of neighbor peers and the per-data forward- 
ing table), and (ii) non-per-data unstructured P2P networks 
(i.e., peers maintain only the list of neighbor peers). 

Non-per-data schemes are mainly Gnutella-like schemes 
(l], where search is performed by means of flooding query 
packets. Search performance over such P2P networks has 
been studied in various contexts, which includes pure ran- 
dom walks 1221, probabilistic flooding techniques ||34ll23ll , 
and systematic filtering techniques iRTI . 

Per-data schemes (e.g. Freenet ||2)) can achieve bet- 
ter search performances than non-per-data schemes, though 
they impose additional storage requirements to peers. By 
making the peers maintain a number of <key,pointer> en- 
tries peers direct the search queries to more appropriate 
neighbors, where "key" is an identifier for the data item be- 
ing searched and the "pointer" is the next-best neighbor to 
reach that data item. This capability allows peers to leverage 
associativity characteristics of search queries fl2l . Stud- 
ies ranged from grouping peers of similar interests (i.e., 
peer associativity) 11291 [121 to exploiting locality in search 
queries (i.e., query associativity) lfTTl[37l . Our work is ap- 
plicable to both per-data and non-per-data unstructured P2P 
networks, since we focus on the interactions between search 
efficiency and topological characteristics. 

Previous study on node isolation caused by churn in un- 
structured P2P networks introduced a general model of re- 
silience ll42l . In this study, joining and rewiring processes 
were based on age-biased neighbor selection, where a for- 
mal analysis included two age-biased techniques of neigh- 
bor selection. In maximal age-selection approach, the join- 
ing node selects uniformly randomly m alive nodes from 
the network and connects to the one with maximal age. 
It follows the same process when a dead link is detected. 
However, in age-biased random walk selection approach, 
the probability of a node to be selected by another peer is 
proportional to its current age. Another study introduced 
self-organizing super peer network architecture ll2D . where 
super peers maintain the cache with pointers to files that are 
recently requested and on the other hand client peers dy- 
namically select super peers offering best search results. 

3. Scale-Free Network Topologies 

Recent research shows that many natural and artificial 
systems such as the Internet l20ll . World Wide Web 0, sci- 
entific collaboration network [9], and e-mail network lfl9l 
have power-law degree (connectivity) distributions. These 
systems are commonly known as power-law or scale-free 
networks since their degree distributions are free of scale 



(i.e., not a function of the number of network nodes N) and 
follow power-law distributions over many orders of magni- 
tude. This phenomenon has been represented by the proba- 
bility of having nodes with k degrees as P(fc)~fc -7 where 
7 is usually between 2 and 3 [8]. Scale-free networks have 
many interesting properties such as high tolerance to ran- 
dom errors and attacks (yet low tolerance to attacks targeted 
to hubs) [6 1, high synchronizability [25, 26, 31], and resis- 
tance to congestion 1 39 1 . 

The origin of the scale-free behavior can be traced back 
to two mechanisms that are present in many systems, and 
have a strong impact on the final topology [ 8 ] . First, net- 
works are developed by the addition of new nodes that are 
connected to those already present in the system. This 
mechanism signifies continuous expansion in real networks. 
Second, there is a higher probability that a new node is 
linked to a node that already has a large number of connec- 
tions. These two features led to the formulation of a grow- 
ing network model first proposed by Barabasi and Albert 
that generates a scale-free network for which P(k) follows 
a power law with 7=3. This model is known as preferen- 
tial attachment (PA or rich-gets-richer mechanism) and the 
resulting network is called Barabasi- Albert network OH). 

In this study, we use a simple version of the PA model 
0. The model evolves by one node at a time and this new 
node is connected to m (number of stubs) different exist- 
ing nodes with probability proportional to their degrees, i.e., 
Pi—ki/J2j kj where fcj is the degree of the node i. The av- 
erage degree per node in the resulting network is 2m and 
the minimum degree is m. 

Scale-free networks are very robust against random fail- 
ures and attacks since the probability to hit the hub nodes 
(few nodes with very large degree) is very small and attack- 
ing the low-degree satellite nodes does not harm the net- 
work. On the other hand, deliberate attacks targeted to hubs 
through which most of the traffic go can easily shatter the 
network and severely damage the overall communication in 
the network. For the same reason the Internet is called "ro- 
bust yet fragile" [ 18 1 or "Achilles' heel" EEL 

Scale-free networks also have small-world [40] proper- 
ties. In small-world networks the diameter, or the mean 
hop distance between the nodes scales with the system size 
(or the number of network nodes) N logarithmically, i.e., 
d~ln N . The scale-free networks with 2<7<3 have a much 
smaller diameter and can be named ultra-small networks 
lfl4l . behaving as d^lnln N. When 7=3 and m>2, d be- 
haves as d~ln N/ In In N. However, when m=l and 7=3 
the Barabasi-Albert model turns into a tree and d~ln N is 
obtained. Also when 7>3, the diameter behaves logarithmi- 
cally as d~ln N. Since the speed/efficiency of search algo- 
rithms strongly depend on the average shortest path, scale- 
free networks have much better performance in search than 
other random networks. 



3.1. The Cutoff 



One of the important characteristics of scale-free net- 
works is the natural cutoff on the degree (or the maximum 
degree) due to finite-size effects. Natural cutoff can be de- 
fined as Ifl6| the value of the degree above which one ex- 
pects to find at most one vertex, i.e., 

/>oo 

N / P(k)dk ~ 1 . (1) 

By using the degree distribution for the scale-free net- 
work and the exact form of probability distribution (i.e., 

P(k) — (j — l)m T_1 /fc 7 ), one obtains 

fcnc(iV) -mN 1 /^, (2) 

which is known as the natural cutoff of the network. The 
scaling of the natural cutoff can also be calculated by using 
the extreme-value theory iflOl . For the scale-free networks 
generated by PA model (7=3) the natural cutoff becomes 

k nc (N) ~ mVN. (3) 

3.2. Preferential Attachment with Hard Cutoffs 

The natural cutoff may not be always attainable for most 
of the scale-free networks due to technical reasons. One 
main reason is that the network might have limitations on 
the number of links the nodes can have. This is especially 
important for P2P networks in which nodes can not possibly 
connect many other nodes. This requires putting an artificial 
or hard cutoff k c to the number of links one node might 
have. 

In order to implement the hard cutoff in PA, we simply 
did not allow nodes to have links more than a fixed hard 
cutoff value during the attachment process. This modified 
method generates a scale-free network in which there are 
many nodes with degree fixed to hard cutoff instead of a 
few very high degree hubs and the degree distribution still 
decays in a power law fashion. The degree distribution of 
PA model with cutoff is slightly different than that of PA 
without a cutoff in terms of exponent and an accumulation 
of nodes with degree equal to hard cutoff. PA model, in 
its original form, has a degree distribution exponent 7=3 
for very large networks. However, when a hard cutoff is 
imposed it is observed that the absolute value of degree dis- 
tribution exponent decreases ll27l . 

One can use the master-equation ll32l approach to ana- 
lyze the effects of the hard cutoff on the topological char- 
acteristics. We grow the network by introducing new nodes 
one by one for simplicity. Each new node links to m earlier 
nodes in the network. The probability that the new node at- 
taches to a previous node of degree k is defined to be A^/A, 



Algorithm 1 Network growth using paramaterized join and 
leave processes 

//Global Variables and Functions 
to - minimum degree 

p - probability of a node to leave the network 

N - the maximum node ID of the existing network ( the 

minimum node ID is 0) 

G - graph of the existing network of M links and N nodes 
PreferentialAttachment(Gi, G 2 ) - a function that per- 
forms Preferential Attachment to Gi by using the nodes in 
Gi, returns the number of successful new links 

// Join process of node i 
void Join(i, rf) 
1: N++ 

2: numof links «— 

3: while numof links < m do 

4: N r and *— Randomize( 1 , N) {Pick a random node 
from the existing network} 

5: myG «— get_subgraph(N ran d,Tj) {Get the sub- 
graph including neighbor nodes of N ran d up to Tj 
hops away} 

6: numof links += PreferentialAttachment(G, myG) 
7: end while 

//Leave process of node i 
void Leave(i, 77) 
1: myG *— getsubgraph(N ram i,Ti) {Get the subgraph 
including neighbor nodes of N ran d up to 77 hops away} 
2: remove(N ran d) {Delete N ran d from the existing net- 
work} 

3: N = N — 1 

4: PreferentialAttachment(G,777yG) 

// Growth process of a network with Ntarget nodes, param- 
eterized with Tj and 77 
void Grow(N target , r jt r\) 

1: for i=777+l; \<N ta rget\ i++ do 

2: Join(N,r,) 

3 : 7777777, <— Random(0,l) 

4: if N == Ntarget then 

5: break; 

6: end if 

7: if num < (1 then 

8: N d ei «— Randomize(l,N) 

9: Leave(Af de/ , 77) 

10: end if 

li: end for 



where A k is the rate of attachment to a previous node and 
this rate depends only on the degree of the target node, while 
A = X^fcLm Ak.Nk is the total rate for all events, and Nk is 
the number of nodes of degree k in the network. Thus A k /A 
equals to the probability for the newly-introduced node to 
attach to a node of degree k. The new feature that we study 
is the effect of a hard cutoff on the degree of each node. 
Once the degree of a node reaches fc c , it is defined to be- 
come inert so that no further attachment to this node can oc- 
cur. Thus only nodes with degrees fc = m,m+l,...,fc c — 1 
are active. This restriction is the source of the cutoff in the 
definition of the total attachment rate. We now study the 
degree distribution, Nk(N), as a function of the cutoff fc c 
and the total number of nodes in the network N. 

The master equations for the degree distribution can be 
written by using the fact that Nk is proportional to N, and 
thus Nk — > Nnk as well as A — > vN as 



n k = 



mn m I 

u 

(fc-l)ra fc _!— fcn fc 
v 

(fee— jjrafcc-l 



k = m 
k = to 



l,...,k c - 1 



(4) 



By the nature of these equations, it is evident that rik c is 
of a different order than n k with k < k c . Starting with 
the solution n m — v/(m + v), we can find by sub- 
sequent substitutions. This recursive approach gives us a 
chance to write n k values as products 1 32 ] and by convert- 
ing these products into Euler gamma functions we show that 
rik c scales as k~ u , while for k < k c , n k scales as k~( v+1 \ 
We can obtain the coefficient v in A = vN self consis- 
tently from A = Ylk=m Akn-k = vN, or equivalently, 
v = J2k*=m Akn>k- By rewriting the sum above as a dif- 
ference between two sums with limits from the minimum 
degree to oo and from cutoff to oo and by taking asymptotic 
limits 11241 of large iV and k c we get 



2m 



(5) 



This result shows that ~ j.-(3-Jm/y f or £ < fc c an( j 
nk a ~ ^~( 2_2m / fc c) confining me change in the degree 
distribution exponent ll27ll . This implies that any finite hard 
cutoff value decreases the degree distribution exponent, i.e., 
it makes the degree distribution flatter. A better search ef- 
ficiency observed for a smaller cutoff can be explained by 
the increase in the degree distribution exponent (27]. 

Ad-hoc scale-free networks have recently attracted con- 
siderable attention in the literature mainly because of its 
most-desired property of robustness to random attacks or 
failures. For example it was shown that [6| the diameter of 
the Internet at the autonomous system level, which is the 
most famous example of scale-free networks, would not be 
changed considerably if up to 2.5% of the routers were re- 
moved randomly. This is an order of magnitude larger than 




Figure 1. Search strategies: (a) Flooding (b) 
Normalized flooding (c) Random walk 



the failure rate. It was also shown in |6 1 that for a scale-free 
network of size 10,000 and a failure rate of 18%, the biggest 
connected component holds 8,000 nodes, whereas under the 
same conditions a random network can survive this failures 
by the biggest connected component of size 100. 

Many models for ad-hoc scale-free networks in which 
the edges can appear and disappear |fT31 |33l [TTl or some 
nodes are removed [36] have been studied. In the first set 
of studies as the nodes are joining to the network some 
links among the pre-existing nodes are rewired or moved 
randomly by some probability parameter. Depending on 
the parameters such models exhibit either exponential or 
power-law degree distributions. In (36), as the nodes are 
joining by preferential attachment some randomly selected 
nodes are deleted along with their links from the network. 
If the nodes whose neighbors are deleted do not reconnect 
themselves to other nodes, it is observed that the degree dis- 
tribution is a power law with an exponent ranging from 3 to 
infinity depending on the deletion probability. The authors 
proposed a remedy for the deletion that the neighbors of the 
deleted nodes select some other nodes in the network and 
connect by again using preferential attachment rules. In this 
case the degree distribution is still a power law but the ex- 
ponent changes from 3 to 2 as the deletion probability goes 
from to 1 . 

The main disadvantage of the ad-hoc scale-free models 
in the literature is that they lack localized algorithmic so- 
lutions. All requires global information to be available to 
nodes so that they can reconnect to randomly selected nodes 
in the network. For this reason we grow scale-free networks 
with local heuristics only to simulate the real-life situation 
in unstructured peer-to-peer systems. To parameterize our 
model we use two different time-to-live variables: Tj and 
ti to describe the number of nodes available to a new node 
and to a neighbor of a deleted node, respectively. In the next 
section we explain our model and its parameters in detail. 
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Figure 2. Degree distributions when there is no ad-hocness (i.e., //=0): /'(//) for various networks 
generated by our framework for varying Tj. 



4. Growing Scale-Free Topologies with Local 
Heuristics 

In the PA model and its ad-hoc variants as outlined in the 
previous section, the new node or the neighbor of a deleted 
node has to make random attempts to connect to the existing 
nodes with a probability depending on the degree of the ex- 
isting nodes. To implement this in a P2P (or any distributed) 
environment, the new node has to have information about 
the global topology (e.g. the current number of degrees each 
node has for the PA model), which might be very hard to 
maintain in reality. Thus, in order for a topology construc- 
tion mechanism to be practical in P2P networks, it must al- 
low joining or rewiring of the nodes by just using locally 
available information. Of course, the cost of using only 
local information is expected to be loss of scale-freeness 
(or any other desired characteristics) of the whole overlay 
topology, which will result in loss of search efficiency in 
return. In this section, we present a practical method using 
local heuristics and no global information about the topol- 
ogy. This model imitates the method for finding peers in 
Gnutella-like unstructured P2P networks. 

In our model, starting with some m + 1 fully connected 
nodes, at every time step a new node with m possible links 
is added to the network and one randomly chosen node 
is deleted with probability /i. Since the nodes are short- 



sighted, i.e., they do not have global information about the 
network, they can only choose from a subset of the network 
(horizon) they construct instead of the whole network. The 
parameter Tj and 77 are the TTL values used by the nodes 
and denote the measure of locality in joining and leaving, 
respectively. A newly added node, first, select a random 
existing node and construct a set of nodes reachable in Tj 
hops or less from that node. Then, this new node randomly 
selects a node from this set and connects itself with proba- 
bility proportional to its degree. This probability is normal- 
ized by the total degree of the nodes in the set. The new 
node randomly selects other nodes in the set until its degree 
reaches m. If no node is left in the set to connect but the 
degree of the new node is less than m, it selects another 
random node from the network and continue this process. 
In the deletion case, the neighbors of the deleted node se- 
lects a node randomly from a set of nodes reachable in 77 or 
less steps from the deleted node and connect by using the 
preferential attachment rule. Here, in both cases nodes can- 
not connect to other nodes with degrees equal to the hard 
cutoff. 

There are special cases in this model: i) when Tj = 0, 
the horizon of the new nodes contain only the randomly se- 
lected nodes and the preferential attachment rule is invalid. 
In this case the new node connect to this single node in the 
horizon if its degree is less than the hard cutoff, ii) when 



t; = the neighbors of the deleted node do not have any 
node in their horizons so no rewiring occurs. These nodes 
just loses one of their links and they do nothing to com- 
pensate it. The model typically becomes the preferential 
attachment with global information when Tj value is large 
and ti is zero and a BA network with 7 = 3 is obtained. 

5. Simulations 

In the previous sections, we introduced a framework to 
investigate the effects of join and leave processes in terms 
of scale-freeness of the topology being constructed within 
the context of ad-hoc unstructured P2P networks. Here, we 
study a number of message-passing algorithms that can be 
efficiently used to search items in P2P networks utilizing 
the scale-free degree distribution in sample networks gener- 
ated by our topology construction algorithms. These search 
algorithms are completely decentralized and do not use any 
kind of global knowledge about the network. We consider 
three different search algorithms: flooding (FL), normalized 
flooding (NF), and random walk (RW). 

Goals of our simulation experiments include: 

• Effect of ad-hocness on the search efficiency in an 
uncooperative environment with hard cutoffs: Ad- 
hocness of nodes joining or leaving the network affects 
the search efficiency, i.e., number of hits per unit time. 
Further, applying hard cutoffs on such ad-hoc scale- 
free topologies reduces the degree distribution expo- 
nent. We are interested in observing the effect of this 
ad-hocness and hard cutoffs on the search efficiency 
for three search algorithms, i.e., FL, NF, and RW. This 
extends our previous work in [27|, which focussed on 
the effect of hard cutoffs only. 

• Ad-hoc scale-free topology construction with global 
vs. local information: Though we showed in the pre- 
vious section that using local information when a peer 
is joining yields a less scale-free topology, the effect of 
this on search efficiency still needs to be shed light on. 
Our simulations aim to investigate this too. 

5.1. Search Algorithms 

We use three search techniques to evaluate our ad-hoc 
scale-free topologies: 

Flooding (FL): FL is the most common search algorithm 
in unstructured P2P networks. In search by FL, the source 
node s, sends a message to all its nearest neighbors. If 
the neighbors do not have the requested item, they send on 
to their nearest neighbors excluding the source node [see 
Fig. 02 a )]- This process is repeated a certain number of 
times, which is usually called time-to-live (TTL). 



Table 1. Parameters of our topology con- 
struction framework 



Symbol 


Parameter Description 


Range 


A* 


Ad-hocness of the nodes 


[0,D 


T o 


Available information at join 


> 


n 


Available information at leave 


> 


k c 


Hard cutoff 


> 1 


m 


Minimum degree (# of stubs) 


> 1 



Normalized Flooding (NF): In NF, the minimum degree m 
in the network is an important factor. NF search algorithm 
proceeds as follows: When a node of degree m receives a 
message, the node forwards the message to all of its neigh- 
bors excluding the node forwarded the message in the previ- 
ous step. When a node with larger degree receives the mes- 
sage, it forwards the message only to randomly chosen m of 
its neighbors except the one which forwarded the message. 
The NF mechanism is illustrated in Fig.QJb). In this simple 
network with m — 2, the source node sends a message to 
its randomly chosen two neighbors and these neighbors for- 
ward the message to their randomly chosen two neighbors. 
In the third step, the message reaches its destination. 

Random Walk (RW): RW or multiple RWs have been used 
as an alternative search algorithm to achieve even better 
granularity than NF. In RW, the message from the source 
node is sent to a randomly chosen neighbor. Then, this ran- 
dom neighbor takes the message and sends it to randomly 
selected one of its random neighbors excluding the node 
from which it got the message. This continues until the des- 
tination node is reached or the total number of hops is equal 
to TTL. A schematic of RW can be seen in Fig. 02c)- RW 
can also be seen as a special case of FL where only one 
neighbor is forwarded the search query, providing the other 
extreme situation of the tradeoff between delivery time and 
messaging complexity. 

5.2. Results 

We simulated the three search algorithms FL, NF, and 
RW on the topologies generated by our framework with 
three different parameters: (i) ad-hocness, < /x < 1, (ii) 
available information during join, Tj > 0, (iii) available in- 
formation during leave, 77 > 0, (iv) hard cutoff, fc c > 1, 
and (v) minimum degree (number of stubs), m. These pa- 
rameters are listed in Table[TJas well. By assigning different 
values to each of these parameters, we generated topologies 
with 10000 nodes. We used different k c values from 10 to 
100 (or just a few in this range), in addition to the natural 
cutoff, i.e., no hard cutoff. We varied Tj and t; from to 3. 
Minimum degree values (or m) in our topologies were 1, 2, 
or 3. We studied smaller values of [i from to 0.3, reflect- 
ing no churn to 30% churn, respectively. We performed 5 
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Figure 3. Degree distributions over ad-hoc nodes (i.e., ^=0.3): P(k) for various networks generated 
by our framework for varying t 3 and n. 
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Figure 4. Flooding (FL) performance over 
topologies with m=3 and no ad-hocness. 

realizations of our results. 

We varied the TTL (i.e., time-to-live) values of search 
queries in FL and NF to the point we reach the system size. 
To compare search efficiencies of RW and NF fairly, we 
equated TTL of RW searches to the number of messages 
incurred by the NF searches in the same scenario. Thus, 
for the search efficiency graphs of RW when TTL is equal 
to a particular value such as 4, this means that the number 
of hits corresponding to that TTL=4 value is obtained by 
simulating a RW search with TTL equal to the number of 
messages that were caused by an NF search using a TTL 
value of 4. A similar normalization was done in 

5.2.1 Effects on Degree Distribution 

Our simulation results show the effect of ad-hocness and 
hard cutoff on the degree distribution of the topologies. Fig- 
ure [2] shows the degree distribution of the topologies when 
there is no ad-hocness, i.e., /i=0. Similarly, Figure[3]shows 
the degree distributions when the nodes are ad-hoc with 
/i=0.3. It is known that using more global information (i.e., 



knowing more of the network topology) helps to generate 
better scale-free topologies with lower power-law exponent. 
This phenomenon is clearly shown in Figured i.e., the de- 
gree distribution shifts from an Exponential one to a power- 
law one as Tj increases from to 2 (i.e., the joining node 
uses the topology information at a larger horizon). This is 
true for both m=3 and m= 1 , though larger m makes the shift 
a little less apparent. Further, the hard cutoff, k c , only af- 
fects this process by simply bounding the very large hubs to 
the cutoff without affecting the transition from Exponential 
to power-law. 

An interesting result being revealed in Figure [3] is that 
t; has much more significant effect than Tj in shifting the 
degree distribution from Exponential to power-law. This is 
even more apparent for smaller values of the cutoff. 



5.2.2 Effects on Search Efficiency 

In flooding by far the most important parameter when there 
is no deletion in the network is the cutoff which determines 
the number of distinct nodes one can reach from a node, 
see Figure|U In this case, tj is also an important parameter 
which changes the network from an exponential to a scale- 
free one and give better efficiency in flooding. 

Our simulations also show that this effect can be relieved 
by increasing the minimum degree in the network as it can 
be seen in Figure|5] More interestingly, ad-hocness plays an 
important role in the efficiency of search algorithms. Nega- 
tive effect of the high ad-hocness (high fi) can be eliminated 
by increasing the available information in rewiring, i.e., by 
increasing tj in both flooding and normalized flooding, see 
Figure [6] In some cases in normalized flooding higher ad- 
hocness yields better efficiency for enough high values of 
ti . Here, we do not present results for random walk search 
algorithm since the qualitatively they are not different than 
normalized flooding except that the random walk is more 
vulnerable to isolated clusters in the network. 
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Figure 5. Flooding (FL) performance over topologies generated with fc c =10 and m=3. 
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Figure 6. Normalized Flooding (NF) performance over topologies generated with various m, r ( , and 



ti values. 
6. Summary and Discussions 

In summary, we worked on an ad-hoc limited scale-free 
network model for unstructured peer-to-peer networks. We 
first developed localized joining and leaving schemes for 
the peers and measure the efficiency of search algorithms 
such as flooding and normalized flooding. By considering 
the fact that the peers do want to store too many links in- 
formation we also imposed a hard cutoff on the degree a 
node can have and analyzed its effect on the search effi- 
ciency. We parameterized the locality of the joining and 
leaving schemes by two parameters: tj (for joining) and 
t; (for leaving) which are the number of hops nodes will 
use to construct sets of nodes from which they will ran- 
domly choose other nodes and attempt to connect by using 
the preferential attachment rules and by observing the hard 
cutoff. Typically, high values of these parameters will make 
the network a preferential attachment network with degree 
distribution exponent 3. We also modeled the random dele- 
tion of the nodes by a probability parameter fi. 

Our search simulations show that the negative effects of 
the low cutoff and high probability of deletion can be eased 
by increasing the minimum degree in the network. This also 
helps one to avoid the pathological case of m=l for which 
the network will likely to have isolated clusters hindering 



the efficiency of the search algorithms. To remedy the neg- 
ative effects of high values of fi which destroys the scale- 
freeness in the network we enlarged the locality of the leav- 
ing scheme, i.e., increasing t; for a fixed Tj and cutoff will 
increase the efficiency of normalized flooding. Our findings 
are directly applicable to current unstructured P2P networks 
in which the peers leave the network unexpectedly and they 
have an upper limit for degree. 
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